Dayhoff sequences, SPBC16H5_5, S64705, YAL5_SCHPO, CYP4_CAEEL, CELC34D4_7, CYPA_CAEEL, 
HUMORF006_1, CYPI_MYCTU, AF043642_1 andHSSRCYP_l. 

EXAMPLE 41 : Isolation of cDNA Clones Encoding Human PR0873 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 
5 above, wherein the consensus sequence obtained is herein designated DNA39621. Based on the DNA39621 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the fiill-length coding sequence for 
PR0873. 

A pair of PGR primers (forward and reverse) were synthesized: 
10 forward PGR primer S'-AGGTGCCTGGAGGAGTGGTGnGG-^' (SEQ ID NO:255) 
reverse PGR prhner 5'- GGACGTGAGGAAGGGGAAGATGCG-3 ' (SEQ ID NO:256) 
Additionally, a synthetic oligonucleotide hybridization probe was consfructed from the consensus DNA39621 
sequence which had the following nucleotide sequence: 
hybridization probe 

15 5'-GAAGGGTACAAGTGGGTGGGGTTGAGCGAGGAGTGTGTGTAGGTG-3 ' (SEQ ID NO:257) 

In order to screen several libraries for a source of a Ml-lengfh clone, DNA from the libraries was 
screened by PGR anq)lification with the PGR primer pair identified above. A positive library was flien used to 
isolate clones encoding the PR0873 gene using the probe oligonucleotide and one of the PGR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue (LIB229). 

20 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0873 [herein designated as UNQ440 (DNA44179-1362)] (SEQ ID NO:253) and the derived protein sequence 
for PR0873. 

The entu-e nucleotide sequence of UNQ440 (DNA44179-1362) is shown in Figure 90 (SEQ ID 
NO:253). Qone UNQ440 (DNA44179-1362) contains a single open reading frame with an apparent ttanslational 

25 initiation site at nucleotide positions 139-141 and ending at the stop codon at nucleotide positions 1774-1776 
(Figure 90). The predicted polypeptide precursor is 545 amino acids long (Figure 91). The fiill-length PR0873 
protein shown in Figure 91 has an estimated molecular weight of about 58,934 daltons and a pi of about 9.45. 
Analysis of the ftill-length PR0873 sequence shown in Figure 91 (SEQ ID NO:254) evidences the presence of 
the following features: a signal peptide from about amino acid 1 to about amino acid 29; a carboxylesterase type- 

30 B serine active site at about amino acid 3 12 to about amino acid 327; a carboxylesterase type-B signature 2 motif 
at about amino acid 218 to about amino acid 228; and three potential N-glycosylation sites at about amino acid 
3 18 to about amino acid 321 , about amino acid 380 to about amino acid 383 , and about amino acid 465 to about 
ammo acid 468. Glone UNQ440 (DNA44179-1362) has been deposited with ATCC on May 6, 1998 and is 
assigned ATGG deposit no. 209851. 

35 Analysis of the amino acid sequence of the fiill-length PR0873 polypeptide suggests that it possesses 

significant sequence similarity to a human liver carboxylesterase, tihereby indicating that PR0873 may be a novel 
carboxylesterase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissPrbt 35) evidenced 
significant homology between the PR0873 amino acid sequence and the following Dayhoff sequences: 
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ES10_RAT, GEN12405, AB010633_1, EST4_RAT. A48809, SASB_ANAPL, RNU41662_1, ]RNU22952_1, 
BAL_RAT, GEN13522. 

EXAMPLE 42 : Isolation of cDNA Clones Encoding Human PRO940 

A consensus sequence was obtained relative to a variety of EST sequences as described! in Example 1 
5 above, wherein the consensus sequence obtained is herein designated DNA47442. B^ed on the DNA47442 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO940. 

A pair of PCR primers (forward and reverse) were synthesized: 
10 forward PCR primer 5'-CAAAGr.CTGr.nr.rTnG,rrTCiTn.V (SEQ ID NO:260) 
reverse PCR Primer 5'-TTCTGGAGCCCAGAGGGTGCTGAG-3' (SEQ ID NO:262) 
Additionally, a synflietic oligonucleotide hybridization probe was constructed from the consensus DNA47442 
sequence which had the foUowmg nucleotide sequence 
hyhridiTatinn prnTv 

15 5'-GGAGCTGCCACCCATTCAAATGGAGCACGAAGGAGAGTTCACCTG-3 ' (SEQ ID NO:263) 

In order to screen several libraries for a source of a Ml-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO940 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue (L1B229). 

20 DNA sequencmg of the clones isolated as described above gave the ftill-lengfh DNA sequence for 

PRO940 [herem designated as UNQ477 (DNA54002-1367)] (SEQ ID NO:258) and the derived protem sequence 
for PRO940. 

The entire nucleotide sequence of UNQ477 (DNA54(K)2-1367) is shown m Figure 92 (SEQ ID 
NO:258). Clone UNQ477 (DNA54002-1367) contams a single openreadmg frame with an apparent transladonal 

25 mitiation site at nucleotide positions 46-48 and endmg at the stop codon at nucleotide positions 1678-1680 
(Figure 92). The predicted polypeptide precursor is 544 amino acids long (Figure 93). The full-length PRO940 
protein shown in Figure 93 has an estimated molecular weight of about 60,268 daltons and a pi of about 9.53 . 
Analysis of the full-length PRO940 sequence shown m Figure 93 (SEQ ID NO:259) evidences the presence of 
the followmg: a signal peptide from about amino acid 1 to about ammo acid 15, potential N-glycosylation sites 

30 from about amino acid 100 to about amino acid 103, from about amino acid 297 to about amino acid 300 and 
from about amino acid 306 to about amino acid 309 and an immunoglobulin and major histocompatibility 
complex signature sequence block from about amino acid 365 to about ammo acid 371. Clone UNQ477 
(DNA54002-1367) has been deposited with ATCC on April 7, 1998 and is assigned ATCC deposit no. 209754. 
Analysis of the ammo acid sequence of the full-length PRO940 polypeptide suggests that it possesses 

35 significant sequence sunilarity to CD33 and the OB bindmg protein-2. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology between the PRO940 amino acid 
sequence and the following Dayhoff sequences, CD33_HUMAN, HSU71382_I, HSU71383_1, D86359_l. 
PGBM_HUMAN, MAGS_MOUSE, D86983_l, C22B_HUMAN, P_W01002 and HVU241I6_1. 
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EXAMPLE 43 : Isolation of cDNA Clones Encoding Human PR0941 

A coiKensus sequence was obtained relative to a variety of EST sequences as described in Example 1 
above, wherein the consensus sequence obtained is herein designated DNA35941 . An EST sequence proprietary 
to Genentech was employed in the assembly and is herein designated DNA6415 (Figure 96; SEQ ID NO:265). 
Based on the DNA35941 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA 
5 library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0941. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PGR primer 5 '-GTTGAGTGTGTGTGAATCTGGAGGG-3 ' (SEQ ID NO:266) 
reverse PGR primer 5'-AAGTGGTGGAAGGGTGGAGTGTGG-3 ' (SEQ ID NO:267) 
10 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35941 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-GCAGTAGGGTATTAGAGGAAAAGTrAAAAAGGATGATGGTTGGTGGAGGAGG-3' (SEQID NO:268) 
In order to screen several libraries for a source of a full-lengfli clone, DNA from the libraries was 
1 5 screened by PCR amplification witii the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0941 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave flie full-length DNA sequence for 
PR094 1 [herem designated as UNQ478 (DNA53906-1368)] (SEQ ID NO:263) and the derived protein sequence 
20 forPR0941. 

The entire nucleotide sequence of UNQ478 (DNA53906-1368) is shown in Figure 94 (SEQ ID 
NO:263) . Clone UNQ478 (DNA53906- 1368) contains a single open reading frame with an apparent translational 
initiation site at nucleotide positions 37-39 and ending at the stop codon at nucleotide positions 2353-2355 
(Figure 94). The predicted polypeptide precursor is 772 amino acids long (Figure 95). The full-length PR0941 

25 protein shown in Figure 95 has an estimated molecular weight of about 87,002 daltons and a pi of about 4.64. 
Analysis of the full-length PR0941 sequence shown in Figure 95 (SEQ ED NO:264) evidences tlie presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 21, potential N-glycosylation sites 
from about amino acid 57 to about amino acid 60, from about amino acid 74 to about amino acid 77, from about 
amino acid 419 to about amino acid 422, from about amino acid 437 to about amino acid 440, from about amino 

30 acid 508 to about amino acid 511, from about amino acid 515 to about amino acid 518, from about amino acid 
5 16 to about amino acid 519 and from about amino acid 534 to about amino acid 537, and cadherin extracellular 
repeated domain signature sequences from about amino acid 136 to about amino acid 146 and from about amino 
acid 244 to about amino acid 254. Clone UNQ478 (DNA53906-1368) has been deposited with ATCC on April 
7, 1998 and is assigned ATCC deposit no. 209747. 

35 Analysis of the amino acid sequence of the full-length PR0941 polypeptide suggests that it possesses 

significant sequence similarity to a cadherin protein, thereby indicating that PR0941 may be a novel cadherin 
protein family member. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR094 1 amino acid sequence and the following Dayhoff sequences. 



267 



